Search results for "cluster [track data analysis]"

showing 10 items of 1171 documents

Global emergence of the widespread Pseudomonas aeruginosa ST235 clone

2018

Abstract Objectives Despite the non-clonal epidemic population structure of Pseudomonas aeruginosa , several multi-locus sequence types are distributed worldwide and are frequently associated with epidemics where multidrug resistance confounds treatment. ST235 is the most prevalent of these widespread clones. In this study we aimed to understand the origin of ST235 and the molecular basis for its success. Methods The genomes of 79 P. aeruginosa ST235 isolates collected worldwide over a 27-year period were examined. A phylogenetic network was built, using a Bayesian approach to find the Most Recent Common Ancestor, and we identified antibiotic resistance determinants and ST235-specific genes…

0301 basic medicineMost recent common ancestorClone (cell biology)[ SDV.MP.BAC ] Life Sciences [q-bio]/Microbiology and Parasitology/Bacteriologymedicine.disease_causeGlobal HealthGenome[ SDV.MP ] Life Sciences [q-bio]/Microbiology and ParasitologyPrevalenceCluster Analysis[ SDV.BIBS ] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]High-risk clonesPhylogenyComputingMilieux_MISCELLANEOUSMolecular EpidemiologyGeneral Medicine3. Good healthInfectious Diseases[SDV.MP]Life Sciences [q-bio]/Microbiology and Parasitology[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA][ SDV.BBM.GTP ] Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]Pseudomonas aeruginosaEfflux[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]FluoroquinolonesMicrobiology (medical)Genotype030106 microbiologyEpidemic[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE]BiologyBacterial resistanceMicrobiology[INFO.INFO-IU]Computer Science [cs]/Ubiquitous ComputingEvolution Molecular03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR]Antibiotic resistanceDrug Resistance BacterialmedicinePseudomonas InfectionsGenePseudomonas aeruginosaPathogenInternational clones[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationMultiple drug resistanceGenes Bacterial[INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET]Multilocus Sequence Typing
researchProduct

Toward a direct and scalable identification of reduced models for categorical processes.

2017

The applicability of many computational approaches is dwelling on the identification of reduced models defined on a small set of collective variables (colvars). A methodology for scalable probability-preserving identification of reduced models and colvars directly from the data is derived—not relying on the availability of the full relation matrices at any stage of the resulting algorithm, allowing for a robust quantification of reduced model uncertainty and allowing us to impose a priori available physical information. We show two applications of the methodology: (i) to obtain a reduced dynamical model for a polypeptide dynamics in water and (ii) to identify diagnostic rules from a standar…

0301 basic medicineMultidisciplinarybusiness.industryComputer scienceDimensionality reductionBayesian inferenceMachine learningcomputer.software_genre01 natural sciencesReduction (complexity)010104 statistics & probability03 medical and health sciencesIdentification (information)030104 developmental biologyPhysical informationPhysical SciencesA priori and a posterioriArtificial intelligenceData mining0101 mathematicsCluster analysisbusinessCategorical variablecomputerProceedings of the National Academy of Sciences of the United States of America
researchProduct

A clustering package for nucleotide sequences using Laplacian Eigenmaps and Gaussian Mixture Model.

2018

International audience; In this article, a new Python package for nucleotide sequences clustering is proposed. This package, freely available on-line, implements a Laplacian eigenmap embedding and a Gaussian Mixture Model for DNA clustering. It takes nucleotide sequences as input, and produces the optimal number of clusters along with a relevant visualization. Despite the fact that we did not optimise the computational speed, our method still performs reasonably well in practice. Our focus was mainly on data analytics and accuracy and as a result, our approach outperforms the state of the art, even in the case of divergent sequences. Furthermore, an a priori knowledge on the number of clust…

0301 basic medicineNematoda01 natural sciencesGaussian Mixture Model[STAT.ML]Statistics [stat]/Machine Learning [stat.ML][MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]ComputingMilieux_MISCELLANEOUScomputer.programming_language[STAT.AP]Statistics [stat]/Applications [stat.AP]Phylogenetic treeDNA ClusteringGenomicsHelminth ProteinsComputer Science Applications[STAT]Statistics [stat]010201 computation theory & mathematics[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA]Data analysisEmbeddingA priori and a posteriori[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Health Informatics0102 computer and information sciences[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE]Biology[INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR]Laplacian EigenmapsAnimalsCluster analysis[SDV.GEN]Life Sciences [q-bio]/GeneticsModels Geneticbusiness.industryPattern recognitionNADH DehydrogenaseSequence Analysis DNAPython (programming language)Mixture model[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationVisualization030104 developmental biologyComputingMethodologies_PATTERNRECOGNITIONPlatyhelminths[INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET]Programming LanguagesArtificial intelligence[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]businesscomputerComputers in biology and medicine
researchProduct

Dissection of DLBCL microenvironment provides a gene expression-based predictor of survival applicable to formalin-fixed paraffin-embedded tissue

2018

Abstract Background Gene expression profiling (GEP) studies recognized a prognostic role for tumor microenvironment (TME) in diffuse large B-cell lymphoma (DLBCL), but the routinely adoption of prognostic stromal signatures remains limited. Patients and methods Here, we applied the computational method CIBERSORT to generate a 1028-gene matrix incorporating signatures of 17 immune and stromal cytotypes. Then, we carried out a deconvolution on publicly available GEP data of 482 untreated DLBCLs to reveal associations between clinical outcomes and proportions of putative tumor-infiltrating cell types. Forty-five genes related to peculiar prognostic cytotypes were selected and their expression …

0301 basic medicineOncologyMalePathologyHematologic MalignanciesBiopsyDatasets as TopicPredictive Value of TestDeconvolutionCohort StudiesTranscriptomeAntibodies Monoclonal Murine-Derived0302 clinical medicineprognosticatorsimmune system diseaseshemic and lymphatic diseasesTumor MicroenvironmentCluster Analysisdigital expression analysisRandomized Controlled Trials as TopicParaffin EmbeddingHematology; OncologyHematologyMiddle AgedPrognosisCorrigendaProgression-Free SurvivalAlgorithmOncology030220 oncology & carcinogenesisCell-of-originFemaleLymphoma Large B-Cell DiffuseSurvival AnalysiAlgorithmsHumanAdultmedicine.medical_specialtyStromal cellMicroenvironmentFormalin fixed paraffin embeddedPrognosiReproducibility of ResultDissection (medical)03 medical and health sciencesDigital expression analysiYoung AdultPrognosticatorPredictive Value of TestsFormaldehydeInternal medicinemedicineHumansProgression-free survivalGeneSurvival analysisAgedTumor microenvironmentCluster AnalysiProportional hazards modelbusiness.industryGene Expression ProfilingReproducibility of ResultsComputational BiologyOriginal Articlesmedicine.diseaseSurvival AnalysisGene expression profiling030104 developmental biologyDLBCLCohort StudieTranscriptomebusinessDiffuse large B-cell lymphomaDLBCL microenvironment deconvolution cell-of-origin digital expression analysis prognosticators
researchProduct

Pharmacogenomics of Scopoletin in Tumor Cells

2016

Drug resistance and the severe side effects of chemotherapy necessitate the development of novel anticancer drugs. Natural products are a valuable source for drug development. Scopoletin is a coumarin compound, which can be found in several Artemisia species and other plant genera. Microarray-based RNA expression profiling of the NCI cell line panel showed that cellular response of scopoletin did not correlate to the expression of ATP-binding cassette (ABC) transporters as classical drug resistance mechanisms (ABCB1, ABCB5, ABCC1, ABCG2). This was also true for the expression of the oncogene EGFR and the mutational status of the tumor suppressor gene, TP53. However, mutations in the RAS onc…

0301 basic medicinePharmaceutical ScienceATP-binding cassette transporterDrug resistancePharmacologycoumarinAnalytical Chemistrychemistry.chemical_compound0302 clinical medicineNeoplasmsDrug DiscoveryABC-transportermicroarraysNF-kappa BABCB5Drug Resistance MultipleGene Expression Regulation NeoplasticMolecular Docking SimulationDrug developmentChemistry (miscellaneous)030220 oncology & carcinogenesisherbal medicineMolecular MedicineSignal TransductionTumor suppressor geneProtein Array AnalysisBiologyArticlelcsh:QD241-44103 medical and health scienceslcsh:Organic chemistrymultidrug resistanceCell Line TumorScopoletinHumansPhysical and Theoretical ChemistryTranscription factorScopoletinOncogenePlant ExtractsOrganic ChemistryTranscription Factor RelAphytotherapy030104 developmental biologyArtemisiachemistryDrug Resistance NeoplasmPharmacogeneticsCancer researchABC-transporter; cluster analysis; coumarin; herbal medicine; microarrays; multidrug resistance; phytotherapyATP-Binding Cassette Transporterscluster analysisMolecules
researchProduct

Patterns of Eating and Physical Activity Attitudes and Behaviors in Relation to Body Mass Index

2016

The aim of the study was to identify and characterize the patterns of the psychological and behavioral characteristics, in relation to body mass index. In addition, the study examined the associations between the patterns and demographic characteristics, exercise, eating habits, and healthrelated psychological variables. Participants were 361 Greek adults, randomly selected and completed self-reported questionnaires. The surveys examined demographic characteristics, healthrelated psychological variables (attitudes and intentions toward exercise and healthy eating, perceived behavioral control, health locus of control, general health, self-control, and body image) and the behaviors of exerci…

0301 basic medicinePhysical activityphysical activityHealthy eatingasenteetOverweight03 medical and health sciencesBMI0302 clinical medicineIntervention (counseling)medicine030212 general & internal medicineEating habitsta315ta515030109 nutrition & dieteticsattitudesGeneral Medicinehealthy eatingLocus of controlklusterianalyysiGeneral healthmedicine.symptomPsychologyBody mass indexClinical psychologycluster analysisPsychology
researchProduct

MicroRNA as crucial regulators of gene expression in estradiol-treated human endothelial cells.

2018

Background/Aims: Estrogen signalling plays an important role in vascular biology as it modulates vasoactive and metabolic pathways in endothelial cells. Growing evidence has also established microRNA (miRNA) as key regulators of endothelial function. Nonetheless, the role of estrogen regulation on miRNA profile in endothelial cells is poorly understood. In this study, we aimed to determine how estrogen modulates miRNA profile in human endothelial cells and to explore the role of the different estrogen receptors (ERα, ERβ and GPER) in the regulation of miRNA expression by estrogen. Methods: We used miRNA microarrays to determine global miRNA expression in human umbilical vein endothelial cel…

0301 basic medicinePhysiologymedicine.drug_classEndothelial cellsCèl·lulesDown-RegulationEstrogen receptorEstrogen receptorsBiologylcsh:PhysiologyEpigenetic regulationReceptors G-Protein-Coupledlcsh:Biochemistry03 medical and health sciencesDownregulation and upregulationmicroRNAGene expressionHuman Umbilical Vein Endothelial CellsmedicineCluster AnalysisHumanslcsh:QD415-436EpigeneticsCells CulturedOligonucleotide Array Sequence AnalysisPrincipal Component AnalysisReceptors d'hormoneslcsh:QP1-981EstradiolGene Expression ProfilingUp-RegulationCell biologyGene expression profilingMicroRNAsMetabolic pathway030104 developmental biologyReceptors EstrogenEstrogenMiRNA
researchProduct

FastaHerder2: Four Ways to Research Protein Function and Evolution with Clustering and Clustered Databases.

2016

The accelerated growth of protein databases offers great possibilities for the study of protein function using sequence similarity and conservation. However, the huge number of sequences deposited in these databases requires new ways of analyzing and organizing the data. It is necessary to group the many very similar sequences, creating clusters with automated derived annotations useful to understand their function, evolution, and level of experimental evidence. We developed an algorithm called FastaHerder2, which can cluster any protein database, putting together very similar protein sequences based on near-full-length similarity and/or high threshold of sequence identity. We compressed 50…

0301 basic medicineProtein structure databaseProteomicsProteomeSequence analysisComputer sciencecomputer.software_genreSensitivity and SpecificitySet (abstract data type)Evolution Molecular03 medical and health sciences0302 clinical medicineSimilarity (network science)Sequence Analysis ProteinGeneticsCluster (physics)AnimalsCluster AnalysisHumansCluster analysisDatabases ProteinMolecular BiologySequenceDatabaseFunction (mathematics)Computational Mathematics030104 developmental biologyComputational Theory and MathematicsModeling and SimulationData miningcomputer030217 neurology & neurosurgerySoftwareJournal of computational biology : a journal of computational molecular cell biology
researchProduct

Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties.

2016

Interest is increasing in the development of non-animal methods for toxicological evaluations. These methods are however, particularly challenging for complex toxicological endpoints such as repeated dose toxicity. European Legislation, e.g., the European Union's Cosmetic Directive and REACH, demands the use of alternative methods. Frameworks, such as the Read-across Assessment Framework or the Adverse Outcome Pathway Knowledge Base, support the development of these methods. The aim of the project presented in this publication was to develop substance categories for a read-across with complex endpoints of toxicity based on existing databases. The basic conceptual approach was to combine str…

0301 basic medicineQuantitative structure–activity relationshipread acrossPredictive Clustering Tree (PCT) methodComputer science610010501 environmental sciencescomputer.software_genre600 Technik Medizin angewandte Wissenschaften::610 Medizin und Gesundheit01 natural sciences03 medical and health sciencesPharmacology (medical)Cluster analysis0105 earth and related environmental sciencesOriginal ResearchAlternative methodsPharmacologytoxicological and structural similaritybusiness.industryQSARlcsh:RM1-950non-animal methods; QSAR; readacross; Predictive Clustering Tree (PCT) method; toxicological and structural similarityIdentification (information)Tree (data structure)030104 developmental biologyConceptual approachlcsh:Therapeutics. PharmacologyKnowledge basenon-animal methodsData miningWeb servicebusinesscomputerFrontiers in pharmacology
researchProduct

Snapshots of a shrinking partner: Genome reduction inSerratia symbiotica

2016

AbstractGenome reduction is pervasive among maternally-inherited endosymbiotic organisms, from bacteriocyte- to gut-associated ones. This genome erosion is a step-wise process in which once free-living organisms evolve to become obligate associates, thereby losing non-essential or redundant genes/functions. Serratia symbiotica (Gammaproteobacteria), a secondary endosymbiont present in many aphids (Hemiptera: Aphididae), displays various characteristics that make it a good model organism for studying genome reduction. While some strains are of facultative nature, others have established co-obligate associations with their respective aphid host and its primary endosymbiont (Buchnera). Further…

0301 basic medicineSerratiaRNA Stability030106 microbiologyved/biology.organism_classification_rank.speciesGenomicsGenomeArticle03 medical and health sciencesRNA TransferGammaproteobacteriaCluster AnalysisAmino AcidsModel organismGene030304 developmental biologyGene RearrangementGenetics0303 health sciencesMultidisciplinarybiologyObligate030306 microbiologyved/biologyBacteriocyteGene rearrangementGene Expression Regulation Bacterialbiochemical phenomena metabolism and nutritionbiology.organism_classificationBiosynthetic PathwaysRNA Bacterial030104 developmental biologyEvolutionary biologyGenes BacterialBuchneraGenome Bacterial
researchProduct